56 research outputs found

    Accelerating data-intensive scientific visualization and computing through parallelization

    Get PDF
    Many extreme-scale scientific applications generate colossal amounts of data that require an increasing number of processors for parallel processing. The research in this dissertation is focused on optimizing the performance of data-intensive parallel scientific visualization and computing. In parallel scientific visualization, there exist three well-known parallel architectures, i.e., sort-first/middle/last. The research in this dissertation studies the composition stage of the sort-last architecture for scientific visualization and proposes a generalized method, namely, Grouping More and Pairing Less (GMPL), for order-independent image composition workflow scheduling in sort-last parallel rendering. The technical merits of GMPL are two-fold: i) it takes a prime factorization-based approach for processor grouping, which not only obviates the common restriction in existing methods on the total number of processors to fully utilize computing resources, but also breaks down processors to the lowest level with a minimum number of peers in each group to achieve high concurrency and save communication cost; ii) within each group, it employs an improved direct send method to narrow down each processor’s pairing scope to further reduce communication overhead and increase composition efficiency. The performance superiority of GMPL over existing methods is evaluated through rigorous theoretical analysis and further verified by extensive experimental results on a high-performance visualization cluster. The research in this dissertation also parallelizes the over operator, which is commonly used for α-blending in various visualization techniques. Compared with its predecessor, the fully generalized over operator is n-operator compatible. To demonstrate the advantages of the proposed operator, the proposed operator is applied to the asynchronous and order-dependent image composition problem in parallel visualization. In addition, the dissertation research also proposes a very-high-speed pipeline-based architecture for parallel sort-last visualization of big data by developing and integrating three component techniques: i) a fully parallelized per-ray integration method that significantly reduces the number of iterations required for image rendering; ii) a real-time over operator that not only eliminates the restriction of pre-sorting and order-dependency, but also facilitates a high degree of parallelization for image composition. In parallel scientific computing, the research goal is to optimize QR decomposition, which is one primary algebraic decomposition procedure and plays an important role in scientific computing. QR decomposition produces orthogonal bases, i.e.,“core” bases for a given matrix, and oftentimes can be leveraged to build a complete solution to many fundamental scientific computing problems including Least Squares Problem, Linear Equations Problem, Eigenvalue Problem. A new matrix decomposition method is proposed to improve time efficiency of parallel computing and provide a rigorous proof of its numerical stability. The proposed solutions demonstrate significant performance improvement over existing methods for data-intensive parallel scientific visualization and computing. Considering the ever-increasing data volume in various science domains, the research in this dissertation have a great impact on the success of next-generation large-scale scientific applications

    Fair Attribute Completion on Graph with Missing Attributes

    Full text link
    Tackling unfairness in graph learning models is a challenging task, as the unfairness issues on graphs involve both attributes and topological structures. Existing work on fair graph learning simply assumes that attributes of all nodes are available for model training and then makes fair predictions. In practice, however, the attributes of some nodes might not be accessible due to missing data or privacy concerns, which makes fair graph learning even more challenging. In this paper, we propose FairAC, a fair attribute completion method, to complement missing information and learn fair node embeddings for graphs with missing attributes. FairAC adopts an attention mechanism to deal with the attribute missing problem and meanwhile, it mitigates two types of unfairness, i.e., feature unfairness from attributes and topological unfairness due to attribute completion. FairAC can work on various types of homogeneous graphs and generate fair embeddings for them and thus can be applied to most downstream tasks to improve their fairness performance. To our best knowledge, FairAC is the first method that jointly addresses the graph attribution completion and graph unfairness problems. Experimental results on benchmark datasets show that our method achieves better fairness performance with less sacrifice in accuracy, compared with the state-of-the-art methods of fair graph learning. Code is available at: https://github.com/donglgcn/FairAC

    Rolling bearing fault diagnosis by a novel fruit fly optimization algorithm optimized support vector machine

    Get PDF
    Based on the nonlinear and non-stationary characteristics of rotating machinery vibration, a FOA-SVM model is established by Fruit Fly Optimization Algorithm (FOA) and combining the Support Vector Machine (SVM) to realize the optimization of the SVM parameters. The mechanism of this model is imitating the foraging behavior of fruit flies. The smell concentration judgment value of the forage is used as the parameter to construct a proper fitness function in order to search the optimal SVM parameters. The FOA algorithm is proved to be convergence fast and accurately with global searching ability by optimizing the analog signal of rotating machinery fault. In order to improve the classification accuracy rate, built FOA-SVM model, and then to extract feature value for training and testing, so that it can recognize the fault rolling bearing and the degree of it. Analyze and diagnose actual signals, it prove the validity of the method, and the improved method had a good prospect for its application in rolling bearing diagnosis

    Research on unsteady aerodynamic performance of last stage for low pressure cylinder of steam turbine

    Get PDF
    The last stage of the turbine low pressure cylinder has a complex flows, especially under the high mass flow and low mass flow conditions, where there is an obvious unsteady flow phenomena and obvious rotor-stator interaction. In order to further research on the flow of the last stage for the low pressure cylinder, the last stage blade of low pressure cylinder of 150 MW steam turbine is taken as the research object. Unsteady numerical simulation on the aerodynamic characteristics of the last stage blade for the steam turbine is conducted under a series of mass flow conditions (high mass flow condition, design condition and low mass flow condition) using the commercial CFD software ANSYS-CFX. As a result, rotor-stator interaction is most obvious in the root of the blade, followed by the midspan region, it is weaker in tip of the blade. Comparing internal flow details of the last stage blade in the three conditions, stator flow export and rotor blade passage have a larger energy loss under the low mass flow condition, and the flow is relatively smooth under the high mass flow and design conditions. Finally, energy conversion efficiency of the last stage blade is the highest under the design condition, followed by the high mass flow, and energy conversion efficiency of the last stage blade is the lowest under the low mass flow condition

    Trustworthy Representation Learning Across Domains

    Full text link
    As AI systems have obtained significant performance to be deployed widely in our daily live and human society, people both enjoy the benefits brought by these technologies and suffer many social issues induced by these systems. To make AI systems good enough and trustworthy, plenty of researches have been done to build guidelines for trustworthy AI systems. Machine learning is one of the most important parts for AI systems and representation learning is the fundamental technology in machine learning. How to make the representation learning trustworthy in real-world application, e.g., cross domain scenarios, is very valuable and necessary for both machine learning and AI system fields. Inspired by the concepts in trustworthy AI, we proposed the first trustworthy representation learning across domains framework which includes four concepts, i.e, robustness, privacy, fairness, and explainability, to give a comprehensive literature review on this research direction. Specifically, we first introduce the details of the proposed trustworthy framework for representation learning across domains. Second, we provide basic notions and comprehensively summarize existing methods for the trustworthy framework from four concepts. Finally, we conclude this survey with insights and discussions on future research directions.Comment: 38 pages, 15 figure

    Rolling bearing fault diagnosis by a novel fruit fly optimization algorithm optimized support vector machine

    Get PDF
    Based on the nonlinear and non-stationary characteristics of rotating machinery vibration, a FOA-SVM model is established by Fruit Fly Optimization Algorithm (FOA) and combining the Support Vector Machine (SVM) to realize the optimization of the SVM parameters. The mechanism of this model is imitating the foraging behavior of fruit flies. The smell concentration judgment value of the forage is used as the parameter to construct a proper fitness function in order to search the optimal SVM parameters. The FOA algorithm is proved to be convergence fast and accurately with global searching ability by optimizing the analog signal of rotating machinery fault. In order to improve the classification accuracy rate, built FOA-SVM model, and then to extract feature value for training and testing, so that it can recognize the fault rolling bearing and the degree of it. Analyze and diagnose actual signals, it prove the validity of the method, and the improved method had a good prospect for its application in rolling bearing diagnosis

    Capacitance-Based Frequency Adjustment of Micro Piezoelectric Vibration Generator

    Get PDF
    Micro piezoelectric vibration generator has a wide application in the field of microelectronics. Its natural frequency is unchanged after being manufactured. However, resonance cannot occur when the natural frequencies of a piezoelectric generator and the source of vibration frequency are not consistent. Output voltage of the piezoelectric generator will sharply decline. It cannot normally supply power for electronic devices. In order to make the natural frequency of the generator approach the frequency of vibration source, the capacitance FM technology is adopted in this paper. Different capacitance FM schemes are designed by different locations of the adjustment layer. The corresponding capacitance FM models have been established. Characteristic and effect of the capacitance FM have been simulated by the FM model. Experimental results show that the natural frequency of the generator could vary from 46.5 Hz to 42.4 Hz when the bypass capacitance value increases from 0 nF to 30 nF. The natural frequency of a piezoelectric vibration generator could be continuously adjusted by this method

    Quantitative local structure determination in mica crystals: ab initio simulations of polarization XANES at the potassium K-edge.

    Get PDF
    An attempt to refine the local structure of a layered structure such as mica is made by combining angle-resolved XANES (AXANES) and single-crystal X-ray diffraction (SC-XRD) experiments. Ab initio calculations of AXANES spectra of several tri-octahedral micas have been used to further interpolate experimental data and to deduce physico/chemical effects. Structural distortions have been found highly correlated with the compositional disordering that arises from electronic interactions between anions and cations, and extend the interlayer entering deep into nearby tetrahedral and octahedral sheets. Multiple occupations at the same atomic site have been investigated in detail both in the parallel and perpendicular components of AXANES spectra. Finally, the best fit obtained, computed in the framework of the multiple-scattering theory, is presented and the limitations of the muffin-tin potential in layered systems are briefly discussed

    Search for dark matter produced in association with bottom or top quarks in √s = 13 TeV pp collisions with the ATLAS detector

    Get PDF
    A search for weakly interacting massive particle dark matter produced in association with bottom or top quarks is presented. Final states containing third-generation quarks and miss- ing transverse momentum are considered. The analysis uses 36.1 fb−1 of proton–proton collision data recorded by the ATLAS experiment at √s = 13 TeV in 2015 and 2016. No significant excess of events above the estimated backgrounds is observed. The results are in- terpreted in the framework of simplified models of spin-0 dark-matter mediators. For colour- neutral spin-0 mediators produced in association with top quarks and decaying into a pair of dark-matter particles, mediator masses below 50 GeV are excluded assuming a dark-matter candidate mass of 1 GeV and unitary couplings. For scalar and pseudoscalar mediators produced in association with bottom quarks, the search sets limits on the production cross- section of 300 times the predicted rate for mediators with masses between 10 and 50 GeV and assuming a dark-matter mass of 1 GeV and unitary coupling. Constraints on colour- charged scalar simplified models are also presented. Assuming a dark-matter particle mass of 35 GeV, mediator particles with mass below 1.1 TeV are excluded for couplings yielding a dark-matter relic density consistent with measurements
    • 

    corecore